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We propose a methodological framework to study the dynamics of inter-regional 
investment flow in Europe from a Complex Networks perspective, an approach with 
recent proven success in many fields including economics. In this work we study the 
network of investment stocks in Europe at two different levels: first, we compute the 
inward-outward investment stocks at the level of firms, based on ownership shares 
and number of employees; then we estimate the inward-outward investment stock at 
the level of regions in Europe, by aggregating the ownership network of firms, based 
on their headquarter location. Despite the intuitive value of this approach for EU 
policy making in economic development, to our knowledge there are no similar works 
in the literature yet. In this paper we focus on statistical distributions and scaling 
laws of activity, investment stock and connectivity degree both at the level of firms 
and at the level of regions. In particular we find that investment stock of firms is 
power law distributed with an exponent very close to the one found for firm activity. 
On the other hand investment stock and activity of regions turn out to be log-normal 
distributed. At both levels we find scaling laws relating investment to activity and 
connectivity. In particular, we find that investment stock scales with connectivity in 
a similar way as has been previously found for stock market data, calling for further 
investigations on a possible general scaling law holding true in economical networks. 
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I. INTRODUCTION 

In this work we study the network of investment stocks in Europe at two different levels 
of graining, a finer and a coarser level. We start by studying the network of investment 
stocks between individual firms and we then proceed to study the network of investment 
stocks between European regions. At each level we focus on two specific aspects of such 
networks: on one hand the statistical distributions of activity, investment stock and con- 
nectivity degree. On the other, the scaling relations between such quantities. In this 
respect this work is related to previous ones in Complex Networks, in economic networks, 
and also in scaling laws in industrial economics. 

Several authors within the Complex Networks scientific community have started to 
focus today on the geographical aspects of empirical networks like for instance the world 
wide air traffic network 

Closer to economics, some authors have studied the so called World Trade Web 
(WTW), i.e. the network of import-export trade among countries in the world 
Also, statistical properties of ownership networks have been studied so far only in a few 
works, and in any case with no focus on geographical aspects. To name a few of the 
works in this area, a study on the firm ownership network of Germany found it to exhibit 
small world properties |15]. In the US stock markets, power law distribution of connec- 
iivity degree and scaling relation between degree and invested volume have been found 
^. However, it has been observed that network structures may differ from each other in 



rom the point of view of the degree 

a. 



terms of control concentration and still look similar 
distribution or the investment volume distribution ^|. 

In the economic literature, some works consider geographical embedding of economic 
networks. It has been argued that dontestic nvahy and _hic industry concentration 
are especially important in creating dynamic clusters |16/]. From a more general perspec- 
tive, some authors assume the existence of a global world-economy in which, since its 
inception in the sixteenth century, the periphery is assigned the function of supplying 
the core with cheap labor and raw materials, while the core has the role of producing 
manufactured goods, which require capital .ntcns-ve high technology Q. 

Statistical distributions in firm demography are well documented: firm size IJJ,^JJ, firm 

n n 

growth fq, firm debts [10|| have been studied in several countries and contexts. However, 
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in general, interactions are poorly considered in empirical studies. Firms are not isolated, 
but rather depend on the interaction with other ones through supply, ownership, part- 
nership relations etc. Such relations should then play a role in the statistical properties 
observed at a macroscopic scale. Also, there exists an interaction between firms and banks 
through loans, and it has been found that the distribution of the so called "bad debts" 
(debts of firms towards the bank when they go bankrupt) is power-law distributed. An- 
other type of interaction which is not well documented are the investment stocks of firms 
in other firms. In this paper we find a power law distribution for investment stock with an 
exponent very close to the one found for firm activity (measured in number of employees, 
see section |n}. Such a finding suggests a scaling law between investment and activity 
that we also investigate and discuss. It is relevant to mention here the relationship be- 
tween empirically observed macroscopic distributions and possible microscopic processes 
that may be responsible for them. In literature about the emerging of scaling laws in 
firm demography, power law distributions are usually obtained as a result of a random 
multiplicative process affecting the size of each firm. Now, it is known from the works 
of Kesten in the 70'ies that under some general conditions, a combination of random 
multiplicative and additive process can give rise to power law distributions ^M]- Even a 
random multiplicative process with a lower reflecting barrier (representing for example a 
bankruptcy threshold below which a firm disappears and a new one is created) gives rise 
to a power law distribution. Instead, a pure random multiplicative process gives rise to 
a log- normal distribution [2^. However, in all these processes, interaction between firms 
is not considered. Among the few works addressing the issue, we mention one in which 
a firm bankruptcy affects indirectly other firms through the interest rate of the central 
bank In this paper we start filling such gap by focusing on the distribution and the 
scaling properties of investments of firms in other firms. Our aim is to contribute to the 
understanding of how interdependency between firms gives rise to well defined statistical 
distributions both at the level of firms as at the level of regions. 
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A. Foreign Direct Investments and Inter-Regional Direct Investments 

Concerning investment there is a tendency to study investment stocks or flows between 
countries, referred to as Foreign Direct Investment (FDI), without considering a higher 
resolution. Contrary to this tendency, in this paper, we focus on investment stocks be- 
tween regions of Europe, which can be referred to as Inter-Regional Direct Investment 
(IRDI). In this sense, as discussed later, some concepts will be borrowed from the FDI 
literature and applied in the IRDI context. The statistical characterization of the network 
of IRDI stocks in Europe is a first step towards relating investment flow patterns at a 
global level to local and regional dynamics. 

FDI is defined as " investment that adds to, deducts from or acquires a lasting interest 
in an enterprise operating in an economy other than that of the investor" the purpose 
is to have an "effective voice in the management of the enterprise", as equivalent to 



holding 10% or more in the foreign enterprise 22|. Foreign affiliates are made up of 
subsidiaries, associates, and branches. Subsidiaries are majority- or wholly-owned by the 
parent companies. Associates are companies in which the investing firm participates in the 
management but does not exercise control. Branches are permanent establishments set 
up by the parent company in which there is no equity share capital apart from that of the 
parent. For associates and subsidiaries, FDI flows consist of the net sales of shares and 
loans (including non-cash acquisitions made against equipment, manufacturing rights, 
etc.) to the parent company plus the parent firm's share of the affiliate's reinvested 
earnings plus total net intra-company loans (short- and long-term) provided by the parent 
company. For branches, FDI fiows consist of the increase in reinvested earnings plus the 
net increase in funds received from the foreign direct investor. FDI fiows with a negative 
sign (reverse fiows) indicate that at least one of the components in the above definition 
is negative and not offset by positive amounts of the remaining components. However, 
the magnitude of FDI flow can also be measured as number of jobs created or increased. 
Contribution to more favorable employment status in the host country is critical when 
evaluating FDI, especially for countries that are battling high unemployment rates or 
want to increase the quality of their workforce. When quantifying FDI, "flow" (function 
of time) and " stock" (cross-sectional/cumulative) are measured and headquarter location 
of the investor plays a critical role jlq . 
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Due to standard legal, institutional and policy attributes that are common across a 
given country it indeed makes sense to focus on FDI. However, nowadays firms may 
perceive some regions in another country as more similar than regions within national 
borders. This might be revealing that the process of European integration has reduced 
the national specificities perceived by multinationals and that regions now are competing 
to attract FDIs more across than within countries. Therefore it is important to gain insight 
on investment flows at a higher spatial resolution. In this paper we define Inter-Regional 
Direct Investment (IRDI) flow in analogy to FDI, as the flow between two administratively 
separated regions irrespective of their countries. 



B. Relevance of FDI/IRDI for the Global Economy 

Just like FDI, the study of IRDI has prominent policy making implications, in par- 
ticular for governing bodies trying to tackle economic growth and employment creation 
2^ . There are certain general factors that consistently determine which countries/regions 
attract the most investment j^. In particular, investors cite the following: market size 
and growth prospects of the host, wage-adjusted productivity of labor, the availability of 
infrastructures, reasonable levels of taxation and the overall stability of the tax regime. 
Recent crises have magnified perceptions of regulatory risks and greater attention is now 
being focused on the legal framework and the rule of law. Thus the decision process 
in investment is multi-factorial whereas the success or a higher productivity of the in- 
vestment holds only when the host country/region has a minimum threshold stock of 
human capital. Thus, investment contributes to economic growth only when a sufficient 
absorptive capability of the advanced technologies is available in the host economy jl7|. 
Promotional efforts to attract investment have become the focal point of competition 
among developed and developing countries. This competition is maintained even when 
countries are pursuing economic integration at another level. And it also extends to the 
sub-national level, with different regional authorities pursuing their own strategies and 
assembling their own basket of incentives to attract new investments. While some see 
countries lowering standards to attract FDI in a "race to the bottom," others praise FDI 
for raising standards and welfare in recipient countries. The targets for these promotional 
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efforts are dominantly the major players of FDI, namely the Transnational Corporations 
(TNC's) who on the other end push for newer markets (in the last decades the global 
trend of privatization has been a very important mean towards this end). Public interest 
driven policies meanwhile continue to serve as the balancing force, and sustainable de- 
velopment concerns are dealt with accordingly. The EU has attracted over 40% of total 
world flows of FDI in the 1990s, becoming the largest recipient of multinational activity: 
multinationals account for a growing share of gross fixed capital formation in Europe 
(from 6% in 1990 to over 50% in 2000). However, this increasing inflow of FDI in Europe 
has not been equally distributed across countries and regions j3|. 

Now, while statistics are mostly regarded at the country-to-country scale, investments 
are actually made in specific regions with geographic features, local administration con- 
straints, and cultural profile. There seem to be a substantial gap to fill in understanding 
what is the role of region-to-region investment flow in the global economy. For this reason 
we hereby propose to define the IRDI stock network and we investigate some of its sta- 
tistical properties as a first stage of a more comprehensive study to be continued in the 
future. In section |n] we describe the data set analysed and discuss some methodological 
issues. In section IIIII we present and discuss the results of our analysis. In section |3 we 
draw the conclusions and list some possible extensions of the present work. 



II. DATA SETS AND METHODS 



In this section we first describe the content of the firm database we used for our 
analysis. We then introduce the quantities we have measured on the data set and include 
some methodological remarks. 

For the firm information we used data collected in December 2004 from the Amadeus 
database of Bureau Van Dijk. Access to the on-line service of Amadeus is costly and 
was kindly granted by Prof. Delli Gatti of Universita Cattolica di Milano. The database 
provides firm address, financial profile, number of employees, industrial classification, 
names of shareholders and board of directors of virtually all firms in Europe. 

However data are delivered in files of limited size and we were forced to restrict the 
download to a selection of the ensemble of firms. We have chosen to select the firms with 



The Network of Inter- Regional Direct Investment Stocks Across Europe 



7 



number of employees larger or equal to 100. Also, a small percentage of around 1% of 
firms was lost in this selection because the number of employees was not available. The 
original data set consists of 181.945 firms from 39 European countries uniquely identified 
by their Bureau Van Dijk identification number (BVDID). For a given firm, shareholders 
can be individuals/families, governments or other institutions that are not listed as firms. 
Even when they are firms they may have only indirect ownership through intermediate 
firms. Moreover, they may have less than 100 employees and therefore location and profile 
for them is not available in our data set. The set of firms which are involved in ownership 
links with other firms includes 47.621 firms. We have chosen to restrict our analysis to 
the direct ownership network of firms with more than 100 employees, implying that we 
have the financial profile and geographical location of all the firms involved in this set of 
ownership links. This is the set of data we use for the network analysis and it includes 
29.314 firms and 22.174 links. 

The selection of a subset of the ownership links induces of course an underestimation of 
the total hosted investment stock. Still, investigating how investment size among this set 
of firms is distributed in the network and among geographical regions is a very interesting 
point to address. 

A. Defining the Quantities of Interest 

For each firm i we consider the following quantities: the activity measured as number 
of employees in the firm, the shares Wij of firm i owned by any other firm j, and the 
headquarter region Ri of firm i. The number of employees is one of the standard quantities 
used to measure firm size l] , and in the following we will measure also investment stocks in 
terms of number of employees. There are of course other possible measures of investment 
stocks, based on capital rather than on human resources, but from the point of view of 
labour market and economic impact at a local scale, it is relevant to have an estimate of 
how many employees of a firm or a region depend on the investment coming from outside. 

Shares are usually defined as a percentage, but it is more convenient to define Wij 
as a fraction of ownership and, therefore, as a real number in [0, 1]. Not all shares are 
necessarily held by entities external to the firm. Moreover some entities are not firms and 
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FIG. 1: Diagram illustrating how the (right) region network Gr is built up from the (left) firm 
network Gp. 

we only look at shares held by firms, as discussed above. Therefore it holds: 



If we take the number of employees as a measure of activity of a firm, it is natural to 
compute the quantity Sij-. 



that represents the investment stock of firm i held by firm j. 

We can define the firm network as a graph Gp = (VfjEf) Figure ^ left) where Vp 
is a set of nodes representing firms and Ep is a set of directed edges between nodes. 
An edge represents the fact that firm j owns shares of firm i. The order of the 

pair in this notation is natural for the layout of most databases of firms and we adopt 
it. However, keep in mind, that ownership and investment have opposite directions. In 
fact, it is more natural to define in-degree and out-degree of connectivity with respect to 
investments rather than to ownership. We define as in-degree km of a firm the number of 
firms investing in i (holding shares of i). Similarly we define as out-degree kout of a firm 
the number of outside firms in which firm i invests. The connectivity degree or simply 
degree of a node is the number of edges entering or departing from that node. 




(1) 



WijQii 



(2) 
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We can associate to each edge the normahzed weight Wij, representing the amount 
of shares that firms j owns in i. But we can also associate the absolute weight Sij. We 
then define the inward investment stock a-" of firm i , as the total stock invested in firm 
i by other firms and outward investment stock a°"*, as the total stock invested by firm i 
in other firms: 

ar = J2s,, aT' = Y.s,, (3) 

3 k 

We can now define analogous quantities aggregated by region. The activity of region 
m is defined as the sum of the activity of the firms with headquarter in that region. 

A^ = ^ai (4) 

In other words, Am is the total number of workers employed by firms of that region. 
Keep in mind that we analyse only a subset of all firms and that therefore, the activity 
of a region can be much smaller than the number of individuals employed in that region. 
The sum of the investments made by firms of region n in firms in region m is defined as: 

Smn ^ ^ ^ij (5) 

It can be seen both as the outward investment stock of region n in region m or as the 
hosted investment stock in region m coming from region n 

It is very natural at this point to define the region network as a graph Gji = [Vr, Eh) 
where nodes represent regions and a directed edge (m, n) from region m to region n 
represents the fact that some firms of region n own shares in some firms of region m. The 
diagram in figure ^ illustrates the procedure of building the network of regions. Small 
circles represent firms with their associated values of activity. Edges represent ownership 
relations. Larger circles represent regions in which firms have their headquarters. The 
edges in firm network among all firms in regions m and n sum up to form an edge between 
m and n in the region network. We associate to the edge (m, n) the absolute weight Smn- 
The degree is defined as for the firm region and in particular in-degree and out-degree are 
defined with respect to investments. 

The sum of the investments made in firms of a region by firms of any other region will 
be called inward investment stock of region m (eqEI)- In the following we will refer to this 
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quantity also as hosted investment stock. Similarly, the sum of the investments made by 
the firms of a region in firms of other regions will be called outward investment stock of 
region m (eqlHI). 

n n 

Both definitions are chosen in analogy with the terms used in the literature about Foreign 
Direct Investments. But instead of looking at investments between different countries, we 
increase the spatial resolution to the level of regions. 

As only the few most important shareholders of each firm are usually listed in the 
database, the in-degree of firms is a bias quantity with little meaning for our purposes. 
However, when aggregating by region, the in-degree of a region represents the number 
of other regions in which the top shareholders of firms in the focal region have their 
headquarters. This quantity is not limited a priori and as such it makes sense to study 
its distribution. 



B. Measuring the Quantities of Interest 

Our perspective in this work is to try and relate the microscopic and macroscopic 
aspects of the network of investments between firms and between regions. We want to 
look at statistical distributions and not at single or average values, the aim being to try 
and understand what is the individual tendency that builds up the macroscopic properties. 
Therefore we will focus on the probability distributions of the quantities defined above. 

In order to study the probability distribution (pdf) of a variable x it is useful to plot 
its cumulative distribution function (cdf) defined as 

Pc{x) = / p{x)dx (7) 

J x>x 

where x p{x) is the probability distribution function. In words, the cdf gives the 
fraction of a randomly chosen sample of the variable x that lies above the value x. A 
simple way of constructing P{x) is the following. Consider the vector x of real numbers. 
We rank x in ascending order. Clearly, now all values are larger or equal to the first data 
point. So the probability distribution starts from 1 and decreases. The k-th component 
of the vector x has ascending rank k and there are N — k values larger or equal to x{k). 
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The fraction of data larger or equal to x{k) is N — k/N. We therefore simply plot the 
pair {x{k), N — k/N) for all k. If some values of x are repeated and in particular if x is a 
discrete variable, then the plot will display 'stairs'. In this case it is preferable to count 
the fraction P of data that are larger or equal to each value x and then to plot P versus 

X. 

If the distribution of the variable a; is a power law with exponent —7, then its cumula- 
tive distribution is still a power law with exponent —7 + 1. In log — log scale they appears 
as straight lines with different slope: 

p{x) = cix~^ (8) 
P{x) = C2X-^+^ (9) 

where ci,C2 are normalization factors. If instead the distribution of the variable x is log 
normal with coefficients fi and a (eq. irUj) . then it appears as a quadratic curve in log — log 
scale. 

f \ 1 / (logx-/x)^ 

= /t; — TT^ (10) 

\og{p{x)) ~ - log(x) - ^^"^^J^^' (11) 

However, its cumulative distribution does not have an analytical expression. 

It is usually more accurate and safe to estimate the exponent from the cdf rather than 

from the pdf, because of the fluctuations in the frequency of high values of x. This is 

usually done by fltting the cdf in log — log scale with a line and computing the slope, 
iowever, it has been recently remarked that this method introduces a systematic bias 
1^ . An alternative method that doesn't make use of a graphic flt is described in isl ]. 

The formula for the exponent is 



7 = 1 + 



X{ 
^min 



(12) 



where x^in is the lower limit of the range of data following the power law. Confidence 
intervals for 7 can be computed with the standard bootstrap technique. 

When a relation holds between two variables, then their respective probability distri- 
butions are also related through the equation: 

p{y) = p{f{x)) = pi^)^ (13) 
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An important consequence is that if, x is power law distributed and y scales as a power 
law of X, then y is also power law distributed, and a relation holds for all the exponents 
involved (power law distributions are closed with respect to the operation of power law 
rcscaling). If x is log normal distributed, and y scales as a power law of then the 
distribution of y converges to a log normal function for large y. This observation implies 
that, if two variables are both power law distributed, and we suspect that a relation might 
hold between them, such relation could be a power law scaling, therefore we have to be 
careful while checking for possible correlations. 

In fact, in the study of the correlation between quantities, related to firms and regions, 
that span several order of magnitude and have to be studied in log — log scale, we proceed 
as follows. Consider the variables (X, Y) for instance. We compute the log 10 of both 
variables and produce a scatter plot of {X,Y) — (log 1 OX, log 1 OF). We then divide the 
X axes in k bins of equal size. For each bin centered in the value Xj, we compute the 
mean y^ and the standard deviation (Ty^ of the values of Y for which the corresponding 
abcissa falls in the bin k. We obtain a new set of data points (x, y) that allows to better 
display the trend of the original data (X, F). A linear fit is then performed on the {x^y) 
data points, and values of slope, intercept and correlation coefficient are computed. We 
remind that a linear relationship between x and y implies taking the exponential, a power 
law relationship between the original variables: 

Y^m-X + q (14) 
log 10(y) = m • log 10(X) + ? (15) 

Y = C-X'^ (16) 

where C = 10^. Of course, while the operation of binning and averaging over bins is 
a standard procedure one should be careful in this such operation does not 

commute with the operation of taking the exponential. However, we are not aware of 
any documented bias introduced by this procedure and if the fit is reasonably good we 
conclude that Y is scaling as a power law of X and we take m as the exponent of the 
scaling law. 
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III. ANALYSIS AT THE LEVEL OF FIRMS 

We first report the cumulative frequency distribution of activity and investment stock 
of firms. We also investigate how investment stock scale with firm activity and with firm 
connectivity degree. We then report the analogous results for regions where activity, in- 
vestment stock and connectivity degree of regions are defined as in section Hi Al Similarly, 
we investigate how investment stock scale with region activity and region connectivity 
degree. 

A. Firms. Distributions of activity,investment and connectivity degree 

In figure 121 we report the cumulative distribution (cdf) of activity and investment stock 
of firms computed from our data set. The onset at the value 100 for the activity is 
simply due to the restriction of the data set to firms with more than 100 employees. 
Investment stock can of course take smaller values as it is measured as a fraction of the 
number of employees per firm. The cumulative distributions display a linear decay over 3 
decades or more. However some 'bumps' deviating from linearity are visible. The curves 
can either be fitted with log normal distributions with very large standard deviation 
or can be reasonably fitted with a power law. In both cases the meaning is the same: 
the probability of finding large firms is decreasing approximately as the exponent of the 
power law fitting the curve. Computing the exponent from the linear fit is known to 
introduce bias hence following a known procedure the values of the exponents 
and confidence intervals were computed as described in|nl The values of the exponents 
and their confidence interval are reported in table HI 

Our finding concerning the activity is in line with what is generally known in the 
literature for firm size distributions of many countries and historical epochs. It implies 
that firm activity is very heterogeneous and that roughly speaking, very large values and 
very small values of activity are much more frequent that in normal distributions. We 
remind the reader that the data set analysed includes only firms involved in an ownership 
relationship in Europe and not all firms indiscriminately. However, the value of exponent 
7 is not far from results obatined in previous studies. For instance Axtell reports 2.056 
for the US firm activity distribution Q|. Fujiwara et al. report 1.995 for UK based on 
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TABLE I: Firms. Power law fit values for Cumulative Distributions. 

Data 7 
a 1.7829 0.0038 
a'"^ 1.9307 0.0064 
^out 1,7684 0.0061 
gin+out 1 8480 0.0047 
3.849 0.044 

data from Amadeus Database On the other hand, the fact that investment stock 
distribution is also a power law is to our knowledge a novel result. In particular, the 
values of the exponent 7 for activity and outward investment stock are very close. 




FIG. 2: Firms. Cumulative distribution of: (left) activity a (o); hosted investment stock a*" (x); 
outward investment stock a°"* (+); total investment stock 0**^+""* (A) and (right) normalized 
values with respect to the maximum value 

This finding might suggest that a scaling relation holds between the two variables (as 
discussed in section 111 B|) . However, this hypothesis had not been empirically verified so 
far in the literature and we don't know a priori to what extent it holds. We will then 
investigate its validity in the next section. 

As is usual in the study of networks, we report the cumulative distribution functions 
(cdf) of the degree of connectivity of firms (figure El). We distinguish between (total) 
degree, in-degree and out-degree as defined in III Al The distributions span a short range 
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of less than two decades. The out-degree display a linear decay in log — log scale that can 
be fitted by a power law with exponent 3.85, which is a little larger than the typical values 
observed in many empirical complex networks (typically in the range [1.8 3]). The result 
implies that the number of connections between firms is moderately heterogeneous and 
decreases slower than exponentially. The curve for the in-degree instead is not meaningful 
as mentioned in section 111 Bl and simply shows how many records of shareholders are 
available per firm. The linear fit was performed just for sake of completeness. 




FIG. 3: Firms. Cumulative distribution of connectivity degree. In-degree (x) , out-degree (-I-) 



However, it is important to remark that the connectivity out-degree represents the 
number of ownership relations in which the firm is involved, regardless of the size of 
shares involved in each relation. Because the degree doesn't take into account the size 
of the shares, a large out-degree doesn't mean that a firms really controls a lot of other 
firms. Alternative quantities are needed to characterize the ownership concentration such 
as those introduced in j^^] and they will be applied to the present data set in a future 
study. 

B. Firms. Correlations among activity, investment and connectivity degree 

In order to understand why investment of firms are power law distributed and why 
distributions of outward investment and activity have exponents very close to each other 
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we investigate the correlations between activity, investment and connectivity out-degree. 
The plots in figures El - El are produced by taking the log 10 of the quantities and binning 
the data on the x axes as described in section III Bl 




FIG. 4: Firms. Plot of (left) hosted invested stock a*" and (right) outward invested stock a°"* 
of firms versus their activity a in log — log scale. Data are binned. Mean zt standard deviation 
of the values in each bin are plotted as the continuous lines. 



Table IHl reports the value of slope and correlation coefficients for the linear fit of the 
binned data. The correlation coefficients are all quite close to 1 so, with the caveat 
mentioned in section Hi Bl we conclude that the data indicate the existence of scaling laws 
between investment and activity and between out-degree and activity. 

TABLE II: Firms. Scaling of investments versus activity. Coefficients of linear fit and correlation 
coefficients. 

Yaxis m q corr.coef. 

a™ 0.925 0.026 0.974 

a°"* 0.607 0.644 0.997 
^in+out QJ39 Q_4gQ 0^992 



We notice that the exponent 0.925 for the scaling of hosted investment versus activity 
is smaller than, but still close to 1. This means that firms tend to host investment in 
amount almost just proportional to their activity. This is not surprising if we consider 
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FIG. 5: Firms, (left) Plot of total invested stock a*"+°"* of firms versus their activity a. (right) 
Plot of connectivity out-degree A;°"* of firms versus their hosted investment stock a°"* in log — log 
scale. Plot Data are binned. Mean it standard deviation of the values in each bin are plotted 
as continuous lines. 

that all firms in the data set analysed, are owned to some extent by some other firm in 
the data set and in many cases the share is large and close to 1. A deeper understanding 
would require an investigation of the statistics of the ownership concentration and will 
be carried out in a future work. On the other hand the exponent 0.61 for the scaling 
of outward investment versus activity implies that although more active firms tend to 
invest more, the investment increases less than linearly as a function of the activity (sub- 
linear increase). This means that very large and active firms invest less, in proportion, 
with respect to smaller ones. Such a finding becomes quite important if an institution in 
charge of attracting investments from firms is trying to estimate the expected investment 
of firms based on their activity. 

Finally, the exponent 1.57 for the scaling of activity versus out-degree is quite inter- 
esting. It implies that the larger the firm the larger the number of investments but the 
increase is sub-linear. Interestingly, the value of the exponent is not far from the values 
found for the scaling law between invested volume and degree in some stock markets: 1.1 
for Nasdaq, 1.43 for NYSE, 1.59 for MIB [8J. In that case the invested volume is exactly 
the analogous quantity to the outward investment. Moreover, the scaling law resembles 
the one observed inthe context of air traffic networks for a quantity analogous to the 
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outward investment which has been recently introduced as node strength . In that case 
the exponent found is 1.7. These findings might support the idea that a universal scaling 
law for node strength holds in complex networks where the weight plays a crucial role. 
In a future work we will investigate possible network formation models leading to the 
emergence of such scaling law in economic networks. 

TABLE III: Firms. Scaling of investments versus connectivity degree. Coefficients of linear fit 
and correlation coefficients 



Yaxis 


m q 


corr.coef. 


gOUt 


1.574 2.120 


0.855 



IV. ANALYSIS AT THE LEVEL OF REGIONS 
A. Regions. Distributions of activity, investment and connectivity degree 

In figure IHl (left) we report the cumulative distribution of activity A and in- 
ward/outward/total investment stock A*", ^*"+°"* for regions. These quantities 
were computed from the data set as described in section III Al As done for the case of 
firms, in order to check to what extent they overlap we also normalized such quantities 
and we report their cumulative distribution in figure IHl (right) . As it can be seen, the 
overlap is only partial. This may suggest that a non linear scaling holds between the 
variables and this hypothesis will be investigated in the next section. 

It must be remarked that the way regions are defined within a country surely depends 
on the country's administrative system. To give an example, the regions provided in the 
data set are at the level of 'provincia' for Italy and 'department' for France which have 
comparable surface on average. 

Differently from the case of the firms, the distributions do not display a linear decay 
in log — log scale but rather a quadratic one which we tried to fit with log-normal distri- 
butions. The fit with log normal is quite good although a discrepancy can be observed in 
the right tail. On the other hand, only the very last portion of the distribution could be 
fitted with a power law. Values for the coefficients of the log normal fit are given in table 



The Network of Inter- Regional Direct Investment Stocks Across Europe 



19 




FIG. 6: Regions. Cumulative distribution of: (left) activity A (for all firm profiles) (□); activity 
A (o); hosted investment stock ^4*" (x); outward investment stock (+); total investment 
stock ^«"+o"* (A) and (right) the same quantities normalized with respect to the maximum 
value. Zoom. 

TABLE IV: Regions. Log normal fit values for Cumulative Distributions. 

Data fi A/i a Aa 
A 9.06 0.12 1.97 0.08 

Aalldataset 9.79 0.10 1.82 0.07 

A^" 7.28 0.10 1.64 0.07 
6.89 0.14 2.02 0.10 

y^in+out 7 gg 0.10 1.76 0.07 

K 1.98 0.07 1.18 0.05 
K'"" 1.50 0.06 1.00 0.04 
1.52 0.08 1.16 0.06 

IIVI Just for sake of comparison we also report the exponent of the power law fit of the 
rightmost tail (table |V} 

At a first sight the finding that the distributions above are log normal is puzzling, 
as one may expect that region activity and investment stocks also scale with a power 
law. However, the following remark is relevant at this point. Cities ranges from small 
villages of few inhabitants to metropolis of 10-20 millions. Firms also range from one- 
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TABLE V: Regions. Power law fit values for Cumulative Distributions. 

Data 7 
A 2.29 0.14 

Aalldataset 2.23 0.09 

A'"" 2.58 0.19 
y^out 2.40 0.20 

j^tn+out 2.21 0.11 

K 2.53 0.09 
K*" 3.00 0.15 
j^out 2 AO 0.10 

person enterprises to multinationals with few hundreds thousands employees. On the 
other hand, while the position of the boundaries of a region surely is the result of the 
historical process, the surface and the amount of population within a region is probably 
limited by administration tasks constraints, in the sense that if the administrative load 
becomes too heavy, the region is split into two. For instance within a same country there 
are no regions which are orders of magnitude larger in surface than other regions. On the 
other hand, it is known that activity of countries measured by Gross Domestic Product 
is power law distributed with exponent 1 |7j. So it appears that regions are clearly less 
heterogeneous with respect to firms and countries. However before trying to draw some 
implications for economic development policies, it would be interesting to normalize the 
activity of the regions by the regions surface and/or active population. Unfortunately 
these data were not available during this work. 

In figure [7| we report the cdf of the connectivity degree for regions. All curves are 
clearly not power laws and we tried to fit them with log-normal functions. Values of 
coefficients are reported in table IIVI The curve for the in-degree is systematically above 
the one for the out-degree. Given the fact that the pdf is the derivative of the cdf, the 
plot implies that in the range [1 50] the in-degree is typically smaller than the out-degree 
while above 50 the opposite holds. We don't have an explanation for this finding. On 
the other hand we remind that as in the case of firms, the number of connections is not 
necessarily meaningful. In fact this is exactly why we have introduced the inward and 
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outward investment stock. 




K, k"", K°"^ 

FIG. 7: Network of Regions. Cumulative distribution of connectivity degree. In-degree (x) , 
out-degree total degree (A) 



B. Regions. Activity, investment and connectivity degree: correlations 

In order to understand why investment of regions are log normal distributed and why 
distributions of inward/outward investment and activity display a partial overlap after 
normalization (see section IIlB|) . we investigate the correlations between activity, invest- 
ment and connectivity in/out-degree of regions. With the same procedure used for the 
case of firms (see section iriIB|) . we produced the plots in figures |HE] (see section HTbI 

Tables IVIIVIII report the values of slope and correlation coefficients for the linear fit 
of the binned data. Again the correlation coefficients are all quite close to 1 so, with 
the caveat mentioned in section III Bl we conclude that the data indicate the existence 
of scaling laws in the network of regions between investment and activity and between 
connectivity degree and activity. 

We notice that the exponents 0.62 and 0.82 for the scaling of hosted investment versus 
activity and outward investment versus activity respectively are smaller than 1. 

As seen at the firm level, it follows that although more active regions tend to make 
and host more investment, the investment increases less than linearly as a function of the 
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FIG. 8: Regions. Plot of (left) hosted invested stock A*" and (right) outward invested stock 
j^out q£ j-ggiQj^g versus their activity A in log — log scale. Data are binned. Mean it standard 
deviation of the values in each bin are plotted as the continuous lines. 




A K 

FIG. 9: Regions. Plot of (left) total invested stock of regions versus their activity A. 

(right) Plot of total connectivity degree of regions versus their hosted investment stock 
j^m+out ^ log — log scale. Data are binned. Mean it standard deviation of the values in each bin 
are plotted as the continuous lines. 

activity (sub-linear increase). This means that very active regions invest, in proportion, 
less than smaller regions. However, the increase of outward investment with activity 
is stronger for regions (m = 0.82) than for firms (m = 0.61 see table IHI)- This is an 
interesting result that we will address in the future. 
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FIG. 10: Regions. Plot of connectivity (left) in-dcgree fc'" and (right) out-degree of regions 
versus their hosted investment stock and outward investment stock respectively in 

log — log scale. Data are binned. Mean it standard deviation of the values in each bin are 
plotted as the continuous lines. 

TABLE VI: Scaling of investments versus activity for regions. Coefficients of linear ht and 
correlation coefficients. 

Yaxis m q corr.coef. 
A^" 0.623 0.697 0.997 
A°"* 0.819 -0.420 0.990 
ji^in+out Q_74g Q_4oi 0.998 

The amount of investment made or received by regions in relation with their activity 
is relevant to institutions in charge of fostering development of regions. Although this 
findings cannot provide detailed prediction they could help develop multi-agent based 
models trying to reproduced the observed features with the aim of designing possible 
incentive strategies. 

Finally, the exponents for the scaling of activity versus in/out-degree are again not far 
from the values found for the scaling law in other works such as: between invested volume 
and degree in stock markets and for the scaling law of node strength versus degree in air 
traffic networks. 
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TABLE VII: Scaling of investments versus connectivity degree for regions. Cocflicients of linear 

fit and correlation coefficients. 

Yaxis m q corr.coef. 
A'"" 1.467 2.022 0.965 
1.370 2.268 0.965 
j^in+out i_326 2.201 0.979 



V. CONCLUSIONS 

In this paper wc propose a simple but novel procedure to build the network of inter- 
regional investment, based on the number of employees of firms in each region and their 
network of ownership. In this network representation, the connectivity in-degree of a 
region is the number of other regions from which firms invest in the focal region, while 
the out-degree is the number of regions in which firms of the focal region invest. The 
sum of the weights over the incoming links represents the hosted investment stock of a 
region in terms of employees. The sum over the outgoing links represents the outward 
investment stock of a region in terms of employees. 

We study the statistical properties of investment stock networks at the level of firms 
and at the level of regions. Our first result is that investment stock of firms is power 
law distributed and that, in particular, the exponent of outward investment is very close 
to the one of firm activity. As it is well known, this fact may result from a power law 
scaling relation between activity and outward investment. This is neither obvious nor 
documented in the literature, so it has to be checked empirically. At a first sight, activity 
and investment are quite scattered and span a few orders of magnitude. However, by 
taking the logarithm of the values of activity and investments, and binning the data, we 
indeed find that investment scales as a power of the activity. Moreover, power law scaling 
relations also hold between investment stock and connectivity degree. 

On the other hand, in the case of regions, we find log normal distributions for activity, 
investments and degree. Now, it can be argued that this result might simply be related 
to the distribution of population size across regions (unfortunately we do not have data 
to test this hypothesis at the moment). Even so, it is a remarkable fact that such proba- 
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bility distributions for the regions clearly differ both from those of firms as from those of 
countries in the world. The impact of this fact on the design of global policies to foster 
investments and economic development should be investigated. 

Again, the fact that similar distributions emerge for activity, investment and connectiv- 
ity degree of regions suggests that some relation should hold among them. In particular, 
we find that investment of firms scales as a power of the degree with exponent 1.57 (out- 
degree), while for regions it scales with exponents 1.37 and 1.47 (in-degree and out degree 
respectively). Interestingly, previous studies on different data set have investigated the 
scaling law of two quantities (invested volume and node strength) strictly analogous to 
the investment stock and found exponent values between 1.1 and 1.7 0,0]. The existence 
of scaling laws relating investment, activity and connectivity both in firms and regions 
is an interesting and novel result relevant to the fields of complex networks, industrial 
economics and geography. 

On the other hand, such scaling laws should be of interest for policy making. For 
instance we find that very active regions invest less, in proportion, with respect to smaller 
ones. The same holds for firms, although the coefficient governing the relation between 
investment and activity are different. This kind of result allows to make some statistical 
predictions about the investments that regions will receive or make, based on their activity 
and connectivity. The present work is a first step towards understanding the relation 
between the local dynamics of investment flows and the macro-economical facts emerging 
at a global level. One can ask for example whether such a distribution of investments is 
desirable with respect to some societal goals that might be at stake at the country or at 
the EU level. If it is not, one can investigate if introducing some incentive policies, the 
distribution can be improved with respect to the goals. In this sense, the findings reported 
here should stimulate the investigation of models for managing the development of regions 
and the optimal allocation of resources. Overall, we believe that these results open the 
way for further studies with potential long run implications in policy making at the level 
of EU investment promotion, support to underdeveloped EU regions and optimization of 
investment flow. 
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